InforLorV4, Main, Exploration, bibRecord, 006804

Une nouvelle architecture de compensation du bruit pour la reconnaissance robuste de la parole

Identifieur interne : 006804 ( Main/Exploration ); précédent : 006803; suivant : 006805

Une nouvelle architecture de compensation du bruit pour la reconnaissance robuste de la parole

Auteurs : Khalid Daoudi ; Murat Deviren

Source :

RBID : CRIN:daoudi04a

English descriptors

KwdEn :
- noise robustness, speech recognition.

Abstract

We present a novel noise compensation architecture which makes no assumptions on how the noise sources alter the speech data and which do not rely on clean speech models. Rather, this new architecture makes the (realistic) assumption that speech databases recorded under different background noise conditions are available. Its main principle is to process individually each database and to construct a parametric representation which describes the variation of acoustic models w.r.t. noise models. This representation is then used during recognition to estimate the acoustic models in the new environment. We evaluate the performance of this new compensation scheme on a connected digits recognition task and show that it can perform significantly better than multi-conditions training, which is the most widely used technique in these kind of scenarios.

Affiliations:

Links toward previous steps (curation, corpus...)

to stream Crin, to step Corpus: 003D88
to stream Crin, to step Curation: 003D88
to stream Crin, to step Checkpoint: 000632
to stream Main, to step Merge: 006B07
to stream Main, to step Curation: 006804

Le document en format XML

<record><TEI><teiHeader><fileDesc><titleStmt><title xml:lang="en" wicri:score="69">Une nouvelle architecture de compensation du bruit pour la reconnaissance robuste de la parole</title>
</titleStmt>
<publicationStmt><idno type="RBID">CRIN:daoudi04a</idno>
<date when="2004" year="2004">2004</date>
<idno type="wicri:Area/Crin/Corpus">003D88</idno>
<idno type="wicri:Area/Crin/Curation">003D88</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Curation">003D88</idno>
<idno type="wicri:Area/Crin/Checkpoint">000632</idno>
<idno type="wicri:explorRef" wicri:stream="Crin" wicri:step="Checkpoint">000632</idno>
<idno type="wicri:Area/Main/Merge">006B07</idno>
<idno type="wicri:Area/Main/Curation">006804</idno>
<idno type="wicri:Area/Main/Exploration">006804</idno>
</publicationStmt>
<sourceDesc><biblStruct><analytic><title xml:lang="en">Une nouvelle architecture de compensation du bruit pour la reconnaissance robuste de la parole</title>
<author><name sortKey="Daoudi, Khalid" sort="Daoudi, Khalid" uniqKey="Daoudi K" first="Khalid" last="Daoudi">Khalid Daoudi</name>
</author>
<author><name sortKey="Deviren, Murat" sort="Deviren, Murat" uniqKey="Deviren M" first="Murat" last="Deviren">Murat Deviren</name>
</author>
</analytic>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc><textClass><keywords scheme="KwdEn" xml:lang="en"><term>noise robustness</term>
<term>speech recognition</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front><div type="abstract" xml:lang="en" wicri:score="2655">We present a novel noise compensation architecture which makes no assumptions on how the noise sources alter the speech data and which do not rely on clean speech models. Rather, this new architecture makes the (realistic) assumption that speech databases recorded under different background noise conditions are available. Its main principle is to process individually each database and to construct a parametric representation which describes the variation of acoustic models w.r.t. noise models. This representation is then used during recognition to estimate the acoustic models in the new environment.  We evaluate the performance of this new compensation scheme on a connected digits recognition task and show that it can perform significantly better than multi-conditions training, which is the most widely used technique in these kind of scenarios.</div>
</front>
</TEI>
<affiliations><list></list>
<tree><noCountry><name sortKey="Daoudi, Khalid" sort="Daoudi, Khalid" uniqKey="Daoudi K" first="Khalid" last="Daoudi">Khalid Daoudi</name>
<name sortKey="Deviren, Murat" sort="Deviren, Murat" uniqKey="Deviren M" first="Murat" last="Deviren">Murat Deviren</name>
</noCountry>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration

HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 006804 | SxmlIndent | more

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 006804 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     CRIN:daoudi04a
   |texte=   Une nouvelle architecture de compensation du bruit pour la reconnaissance robuste de la parole
}}

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022

	Serveur d'exploration sur la recherche en informatique en Lorraine
	Attention, ce site est en cours de développement ! Attention, site généré par des moyens informatiques à partir de corpus bruts. Les informations ne sont donc pas validées.

Serveur d'exploration sur la recherche en informatique en Lorraine

Une nouvelle architecture de compensation du bruit pour la reconnaissance robuste de la parole

Une nouvelle architecture de compensation du bruit pour la reconnaissance robuste de la parole

Source :

English descriptors

Abstract

Links toward previous steps (curation, corpus...)

Le document en format XML

Pour manipuler ce document sous Unix (Dilib)

Pour mettre un lien sur cette page dans le réseau Wicri